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Abstract. 

One of the most important changes in observational astronomy of the 21 st Century is a rapid 
shift from classical object-by-object observations to extensive automatic surveys. As CCD detec- 
tors are getting better and their prices are getting lower, more and more small and medium-size 
observatories are refocusing their attention to detection of stellar variability through systematic 
sky-scanning missions. This trend is aditionally powered by the success of pioneering surveys 
such as ASAS, DENIS, OGLE, TASS, their space counterpart Hipparcos and others. Such sur- 
veys produce massive amounts of data and it is not at all clear how these data are to be reduced 
and analysed. This is especially striking in the eclipsing binary (EB) field, where most frequently 
used tools are optimized for object-by-object analysis. A clear need for thorough, reliable and 
fully automated approaches to modeling and analysis of EB data is thus obvious. This task is 
very difficult because of limited data quality, non-uniform phase coverage and solution degen- 
eracy. This paper reviews recent advancements in putting together semi-automatic and fully 
automatic pipelines for EB data processing. Automatic procedures have already been used to 
process Hipparcos data, LMC/SMC observations, OGLE and ASAS catalogs etc. We discuss 
the advantages and shortcomings of these procedures. 

Keywords, methods: data analysis, numerical; catalogues, surveys; binaries: close, eclipsing, 
fundamental parameters; techniques: photometric, spectroscopic 



1. Introduction 

Doing astronomy today is simply unimaginable without computers. To facilitate ob- 
serving preparations, we use databases; to observe, we use control software; to reduce 
the acquired data, we use reduction programs. Just how far the computer autonomy of 
the data acquisition process goes is best described by the increasing trend of rcfurbrish- 
ing small and m edium-size telescope s into full y automatic, robotic instrument jjj Surveys 
such as OGLE (lUdalski et aZj |l997lL EROS (|Palanaue-Delabrouille ef"oDll998h - ASAS 



( Poimans ki 2002). space mission Hipparcos' epoch photometry (jPerrvman fc ESAlll997l ). 



and others, have changed observational astronomy: streams of data produced by auto- 
matic telescopes around the world and in space are overwhelming for currently existing 
tools and astronomers cannot cope anymore. 

Take eclipsing binaries, for example. So far there have been about 500 published pa- 
pers with physical and geometrical parameters determined to better than 3% accuracy. 
For a skilled eclipsing binary guru it takes 1-2 weeks to reduce and analyse a single 
eclipsing binary by hand. To date, there are about 10000 photometric/RV data-sets that 
in principle allow modeling to a 3% accuracy. By 2020, the upcoming missions such as 



f A comprehensive list of more than a hundred such facilities may be found, e.g., at 
http : //www. astro .physik.uni-goettingen. de/~hessman/MONET/links .html 
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Figure 1. Schematic view of a typical EB reduction and analysis pipeline. 



Pan-Starrs (jKaiser et aZ.1 120021 ) and Gaia (jPerrvman et aLll200lh will have pushed this 
number to ^10000 000. Even if all observational facilities collapsed at that point so that 
no further data got collected, it would take 12 500 astronomers to analyse these data 
in the next 100 years! Since there are currently about 13 000 members of the IAU, the 
only way to achieve this in the next 100 years by traditional methods is to have every 
astronomer in the world doing eclipsing binaries. And of course, do not forget to shut 
down all robotic telescopes out there! 

With the change in observational astronomy, traditional analysis methods and tools 
need to change too. This paper overviews most important aspects of automatic proce- 
dures, tiers that form a pipeline reduction of eclipsing binary light curves. Next Sec- 
tion deals with basic principles of the reduction and analysis pipeline; Section 3 reviews 
most important applications of automatic pipelines on large-scale survey data. Section 4 
stresses the everlasting importance of dedicated observations. Finally, Section 5 concludes 
and gives some prospects for the future. 



2. Tiers of the reduction and analysis pipeline 

A full-fledged pipeline for reduction and analysis of photometric data of eclipsing binary 
stars would ideally consist of 8 distinct tiers depicted in Figure [TJ 

2.1. Intention and planning 

For as long as we discuss stellar objects in general, and eclipsing binaries in particular, 
there are two apparently frightening facts that need to be considered: 1) a target star 
has already been observed and 2) a target star has already been observed many times. 
There are literally hundreds of photometric survey missions that have been swiping the 
sky across and over in a very wide magnitude range, and chances are indeed slim that a 
given star has not been observed yet. 

According to Hipparcos results, there are about 0.8% of eclipsing binar ies in the 
overall stellar population (917 out of 118 218 stars. iPerrvman fc ESAl I1997T) . Project- 
ing these statistics to other large surveys gives an estimate of how many eclipsing bi- 
naries are expected to be prese nt in survey databases: ^136 000 in ASAS (11076 de- 



tected bvlPaczvnski et a/.ll2006h . ~ 56 000 in the OGLE LMC field (2 5 80 detected by 
Wvrzvkowski et aZ.li2 003) , ~ 16 000 in OGLE SMC field (1 350 detected bv lWvrzvkowski etal 
2004 . ~80 000 in TASS (|Droege et aZ.ll2006l ) etc. Gaia will make a revolution in these 
numbers since the aimed census of the overall stellar population is ~ 1 billion up to 
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V = 20 IjPerrvman et gj] |200lh . Admittedly, magnitude levels and variability detec- 



tion threshold change from survey to survey, but a shortage of eclipsing binaries in the 
databases is more than obvious. In other words, there are many eclipsing binaries out 
there that are either undetected, unconfirmed or misclassified. Stressing a well-known 
fact that eclipsing binaries are unique in their potential to yield accurate masses, radii, 
temperatures and distances, and realizing that many of them are reachable by small-size 
ground instruments, eclipsing binaries should definitely hold one of the top positions on 
observational candidates list. 

2.2. Data acquisition 

Most automated of all pipeline tiers, data acquisition has become a truly reliable run- 
of-the-mill. An example of a fully automatic data acquisition a nd analysis pipeline is 
that of the All-Sky Automated Survey (AS AS, |Poimanskilll997t ). depicted on Figure 



The level of sophistication is already such that it assures accurate and reliable data both 
from ground-based and space surveys - and in plenty. A more serious problem for space 
surveys seems to be telemetry: how do we get the data down to Ea rth? For instan ce, 



Hipparcos' downlink rate was only 24 kbit/s, Gaia's will be 5 Mbit/s (|Lammersll2005l ) - 
significantly less than the bandwidth we are used to from everyday life. To avoid using 
lossy compression algorithms, surveys must use optimized telemetry for the given field 
and/or data pre-processing (e.g. binning, filtering, selective downloads). Reliable and 
lossless I/O pipelines and finding ways to store all the acquired data are definitely two 
of the greatest challenges for data acquisition of the future. 

2.3. Raw image reduction 

Acquired data must be reduced: two-dimensional images must be converted to the ob- 
served quantity (magnitudes, fluxes, . . . ). To fully appreciate the need for accurate image 
reduction, one must consider a multitude of physical and instrumental effects that influ- 
ence the observed data. Some of them - e.g., telescope optics, CCD quantum efficiency 
and non-linearity, filter response - may be adequately treated during the reduction pro- 
cess. Others - sky variability, instrumental temperature dependence, cosmic rays, inter- 
stellar and atmospheric extinction - usually demand more involvement because of their 
dependence on time and wavelength, or because of unknown physical conditions. Raw im- 
age reduction consists of taking the acquired image, extracting the data and removing all 
instrumental artif acts contain ed in that data. This procedure, along with the developed 
tools (e.g., IRAF, Todv 1986t k relies somewhat on human intervention, but in principle 



it could be automated to meet the accuracy of today's surveys. One of the steps in the 
ASAS pipeline, for example, i s a fully automat ed reduction (c.f., Figured]): subtracting 
dark current and fiat-fielding ( Poimanski 19971 ). 



2.4. Pre-processing 

Once the images have been reduced, the data are ideally free from instrumental system- 
atics, but imprints of other effects (most notably atmospheric extinction and variable 
seeing) in phased data are still present. These effects may be significant and, as such, 
they should be removed from the data. To this procedure we refer to as pre-processing. 

There are two approaches to pre-processing: parametric modeling and detrending. The 
former uses modeling functions and seeks optimal parameters to reproduce the effect at 
hand; since it relies on physical insights, its application is more-or-less transparent. De- 
trending, on the other hand, is based on statistical properties of the observed time series 
and uses mathematical tools to achieve t he same goal. Tr e ating a tmospheric extinction 
with parametric models is given e.g., bv IPrsa fc Zwitter ( 2005bl ). while detrending is 
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Figure 2. Automatic pipeline of the ASAS project. The pipeline consists of two separate (yet 
connected) engines: Observer and Analyst. The Observer takes care of the data acquisition, and 
the Analyst takes care of data reduction and analysis. The only human intervention needed 
is closing the observatory in case of bad weather and changing the D AT-2 storage tape s. The 
schematic view was adapted from the description of the ASAS project (jPoim anski 199J). 



presented e.g. bv lTamuz et al. (2005). Since the application of the latter is not limited 



to just a given physical effect, it is well worth stressing its major strengths. 

Strictly speaking, a trend in a time series is a slow, gradual change in observables that 
obscures parameter relationships under investigation. Detrending is a statistical opera- 
tion of removing stochastical dependence in consecutive observations, thus making the 
pre-processed data distribu t ed ac cording to the normal (Gaussian) probability distribu- 
tion function. iTamuz et al. (120051) proposed a generalized Principal component analysis 
(PCA) method that accounts for variable observation uncertainties. The method is able 
to remove systematics from the data without any prior knowledge of the effect. Figure [3] 
shows an example of how the algorithm is able to process noisy planetary transit data (top 
row) by consecutively detrending four distinct systematic effects, yielding the detrended 
data (bottom row). Strengths of the method are its universality and little importance of 
the starting values of trend parameters, and the reduction to ordinary PCA in case of 
constant observation uncertainties. Its deficiencies are non-orthogonal eigenvectors (and 
thus deteriorated statistical properties in cases of a highly variable S/N ratio), a danger of 
filtering out intrinsic long-term variability and no relation to physical background of the 
trend. That said, generalized PCA method has proven to be one of the most successful 
methods for detrending that has been applied so far. 
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Figure 3. Detrending results for 3 planetary transit examples. The plotted diagrams depict 
relative magnitude vs. phase. The top row shows the original data, and t he bottom row show s 
the detrended data, after 4 consecutive detrending iterations. Adapted from lTamuz e t al. (2005). 

2.5. Processing/classification 

By the time the observed data is ready for scientific munching, most of the non-intrinsic 
artifacts should have been removed. By processing we refer to seeking broad scientific 
properties of the observed object: analysis of variance, period determination, phased curve 
folding etc. Closely related is classification: based either on the processing results or on 
statistical pattern analysis, the observed objects are classified into their respective groups. 
While manual approaches usually rely on the former principle (we recognize the shape 
of the light curve and evaluate it critically in a broader physical context - do parameters 
make sense, is the period plausible for a given type of object, . . . ), automatic approaches 
will prefer the latter principle, e.g., through the use of Fourier fitting, inversions, neural 
networks etc. Either way, processing and classification aim to discriminate gems from 
ordinary rocks in terms of our primary interest. 

2.6. Modeling and analysis 

Computationally most demanding task, at least with respect to eclipsing binaries, is 
their modeling and analysis. Seeking and interpreting a set of physical and geometrical 
parameters involves solving the i nverse problem. There are many d edicated cod es that 
enable accurate model ing - WD dWilson fc Devinne vl ll97ll). WINK dWoodl ll97ll). NDE 
( Nelson fc Davislll972r ). EBOP (|Etzellll98ll ). FOTEL (|Hadravalll990l ) and many others. 
We discuss their usage and application to survey data in detail in the following section. 

2.7. Sanity check 

A famous statement by R.E. Wilson, "There is more to modeling eclipsing binaries than 
parameter fitting," pretty much encapsulates the idea of sanity check. Solving the in- 
verse problem does not only mean finding physical and geometrical parameters that best 
reproduce the data, it also means seeking parameter inter-dependencies, understanding 
hyperspace non-linearity and, above all, being aware of the limitations of the data-set 
at hand and the used modeling engine. Since eclipsing binaries are used for "calibrating 
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the calibrations" , mis- and over-interpreting the data may have tragic consequences on 
solution reliability. Getting a solution from a model is only a fraction of the work; the 
majority is assessing its uniqueness and physical feasibility of that solution. 

2.8. Archiving and publication 
More important than the publication of papers themselves is the question on publishing 
data. What to do with the immense data flow that is expected from large-scale surveys? 
How to set standards and specifications for publishing and storing data? How to coordi- 
nate efforts and how to distribute the results? Finally, what is o ur next step in term s of 
model e nhancements? Le t us face it - missions such as CoRoT ([Baglin et all 120021 ) and 
Kepler (jKoch et al\\2Q04\ will deliver milli- magnitude accuracies in just a few years - do 
we honestly believe that our models can support such accuracies? All of these are still 
open questions that demand our immediate attention. 



3. First bites on large databases 

One of the first attempts to survey eclipsing binaries in the LMC goes back to lPavne-Gaposchkin 
( 19711 ). who visually examined about 2000 photographic plates, and classified and listed 
the main characteristics of 78 eclipsing binaries. At that time computers only started 
infiltrating modern astronomy and automatic handling was not possible. Y et at th e same 

time, the first EB modeling cod e s were emerg i ng, most notably thos e of HoralJ (1 1966 , 

19701) IWils on fc Devinnevl jl97ll).IWoodl (ll97lli.lNelson fc Davidd 19721 ) iMochnacki fe Doughty 



19721 ) and somewhat laterlHilll (|l979l ) . lEtzell (|l98lh . lHadraval (|l990f) and lLinnell fc Hubenvl 



1994), that would eventually form the base of automatic pipelines. 

In the early nineties , surveys began to yield first databases that were used for EB 
detection and analysis. iGrison et all ( 19951 ) assembled a list of 79 EBs in the bar of the 
LMC from the EROS survey data. Of those, only one system was previously identified 
as an EB, so this w ork effectively doubled th e number of known EBs in the LMC. In the 
year that followed, Friedemann et al. ( 19961 ) used IRAS data ( Neugebauer et al. 19841 ) 
to look for coincidence s in the positions of EBs taken from the 4 th edition of the GCVS 
( Kholopov et aZ.I Il992h and about 250 000 IRAS sources. They found 233 candidates, 



of those 63% Algol-type binaries where accretion disks could be responsible for the IR 
imprint. 

Attacks on LM C continued bv lAlcock et al. (1997), who used the MACHO database 
( Cook et oi.lll995f) to analyse 611 bright EBs. The selection was based on visual identifi- 
cation by examining phase plots. They pointed out two physical quantities that, besides 
inclination, account for most variance in light curves: the sum of relati ve radii and the 
surfac e brightness ratio. For preliminary analysis the authors used the Nelson fc David 
( 19721 ) code and, following the GCVS designation types, they proposed a new decimal 
classification scheme depicted in Figure [4] 

The next survey to provide results for 933 EBs was OGL E ( Szvmanski et al. 19961 ). 
Series of systematic analys e s were conducted by iRucinski (ll997bU Il998l) and later 



Maceroni fc Rucinski ( 19991 ); Rucinski fc Maceronil (|200lf ) that stressed the success and 



importance of the Fourier decomposition technique (FDT) for classification of variable 
stars. The technique itself - fitting a 4 th order Fourier series to phased data curves and 
mapping different types of variable s in Fouri e r coeffi cient space (c.f., Figure El left) - was 
first proposed for EBs already bv lRucinski (Il973 ) and has been used eve r since, most 
notably for classifying AS AS data ( Poimanskil 20021 IPaczvhski et al. 20061 ). 

Somewhat ironically, the first one to implement a fully automatic analysis pipeline for 
obtaining physical parameters of EBs was the most vocal advocate against any automated 
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Figure 4. A decimal classification scheme proposed bv lAlcock et al\ . The scheme relies of two 
physical parameters: the sum of relative radii and the surface brig htness ratio. F our plots on the 
right are classified data from the MACHO survey. Adopted from lAlcock et all (|1997l ). 
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Figure 5. Left: three types of eclipsing binaries (detac hed, semi-detached, and contact) mapped 
in the 02-04 Fourier composition space; adopted from Paczvhs ki et al\ ((2006). Right: heuristic 
scanning with Powell's direction set method. Converged results are shown for different parameter 
cross-sections; cross-hairs denote the right solution, and the symbol's shade of g ray corresponds 
to the reached \ 2 value: the darker the tone, the lower the x 2 . Taken from Prsa & Z witter 
(120061 1. 



approaches: R.E. Wilson. In their two papers. IWvithe fc Wilson! (|200l[ 120021 ) carried out 
an automatic search from 1459 EBs in the SMC detected by OGLE to find ideal distance 
estimators. WD was run in an automatic mode for the first time, although on a stripped 
level of complexity: the model assumed canonical values for physical parameters poorly 
defined by a single-passband photometric data: mass ratio q = 1, argument of periastron 
uj = or 7r, the temperature of the secondary T2 = 15 000K, no spots, simple reflection, 
synchronous rotation etc. Yet for the first time, an automatic, decision-making pipeline 
was tested against synthetic data and then applied to observations. Despite several de- 
ficiencies (systematics introduced through assertions, DC-based method without heuris- 
tical search for solution uniqueness, no account of reddening) the authors succeeded to 
come up with two groups of candidates for ideal distance indicators: widely detached EBs 
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Figure 6. Left: an example of conversion of phased light curves to 70x15 pixel images, which 
are fe d to the neural network image recogniti on algorithm . Taken from IWvrzvko wski et al\ 
(2003). Right: a tier-based pipeline proposed bv lDevorl (|2005T ): observed light curves are passed 
sequentially through filters and only the ones that fulfil all criteria make it to the next tier. 



and EBs with total eclipses. A manual follow-up analys is of 19 bright, large-amplitude 
candidates in their list was done e.g., bv lGraczvk ( 2003 ). deriving the distance modulus 
to the SMC to be ~18.9±0.1. 

Meanwhile, a number of reliable solutions of individ ual EB solutions was steadily g row- 
ing by a dedicated series of manual analyses, e.g., by I Andersen et all iMunari et al\ and 



others. Instead of immediately going for survey data, our group d ecided to t est fully 
automatic pipelines on these high-quality data. In our early work (jPrsal 120031 ) we ob- 
tained encouraging results for 5 morphologically different EBs, stressing importance of 
data diversity - photometric data without RVs does not suffice for accurate modeling 
results. Trying to follow up on our devised scheme, we soon identified main deficien- 
cies of the DC algorithm: since it is based on numerical derivatives, it may frequently 
diverge, and it gets stuck in local minima. To overcome this, we pro posed two types 
of der ivative-less methods: Nelder & Mead's downhill Simplex me thod ( Prsa fc Zwitterl 
2005c]) and Powell's Direction set method (jPrsa fc Zwitterl l200rj) . To understand and 
explore parameter dege neracy, heuristic scanning and parameter kicking were introduced 
(jPrsa fc Zwitter 2005a . c.f., Figure [5l right) - the problem does not lie in the DC, but 
in the inverse problem itself: its non-linearity, parameter degeneracy and data quality 
limitations. With this in mind w e created a new modeli ng environment called PHOEBLtj] 
(PHysics Of Eclipsing BinariEs; IPrsa fc Zwitterll2005al ) that features a flexible scripting 
language. This language is developed specifically with modeling and analysis of large 
surveys in mind. 

Con tinuing with the OGLE data harvest . IWvrzvkowski et al ( 2003 ): Wvrzvkowski etaTI 
(|2004l) identified 2580 EBs in the LMC and 1351 EBs in the SMC. The novelty of their 
classification approach is using Artificial neural networks (ANN) as an image recogni- 
tion algorithm, based on phased data curves that have been converted to low-resolution 
images as depicted on Figure [H Their classification pipeline was backed up by visual 
examinations of results. Although there were no physical analyses in their pipeline, ob- 
servational properties of the sample, as well as 36 distance estimator candidates for the 
LMC, have been d erived. 

In 2005, iDevorl implemented a tier-based elimination pipeline: observed light curves 
are sequentially passed through filters in the order of increasing computational time cost. 
Each tier filters out light curves that do not conform to the given criteria. Once a clean 
sample of light curves is available, it is submitted to a central part of the pipeline, a dedi- 



f More information on PHOEBE may be found at http : //phoebe . f iz .uni-lj .si 
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Figure 7. Examples of Monte-Carlo simulations for three EB light curves. Most importantly, 
the authors stress the importance of examining phase space cross -sections depicted on the righ t. 
Please refer to the original paper for further details. Taken from Michals ka fc Pigulskil (|2005f ) . 



cated program DEBiL (Detached eclipsing binary light curve solver; c.f., FigureO right), 
fitting a simplified EB model (spherical, limb-darkened stars on a classical Keplerian or- 
bit) to observations. The pipeline yielded 10 861 eclip sing binaries out of 218 699 bulge 
field variables from OGLE II data (Udals ki et al. 1997). Its main advantage is speed: ~ 1 
minute per light curve on a 333MHz Sun UltraSparc 5 workstation. Its main deficiencies 
are lack of the 3rd proposed tier (accurate analysis) and an overly-simplified model that 
may produce false positives among other variables. 

One of the best pa pers, in our opinion, that dea lt with eclipsing binaries from OGLE II 
data, was the one bv Michalska fc Pigulskil (|2005l) . Its thorough analysis and deep insight 
into caveats of the EB field make it exemplary for all similar undertakings in the future. 
The authors limited their analysis to bright (V < 17.5, V— I < 0.5), high S/N, EA type 
binaries that exhibit small proximity effects. After proving by example that the original 
differential image analysis (DIA) calibration is flawed due to uncertainty of reference flux 
in the flux-to-magnitude calibration, they proposed a novel method of calibrating DIA 
data and demonstrated its significantly better results. Once the OGLE II data has been 
re-calibrated, the authors added MACHO, OGLE I and EROS data (when available). 
The data have been submitted to a WD-based pipeline: the first step was to find initial 
parameter estimates by the Monte-Carlo method (c.f., Figure[7]), and the second step was 
to converge to the final solution by DC. A result is a list of 98 proposed candidates for 
distance estimates to the LMC, along with accurately determined parameters in relative 
units. Out of the sample, 58 stars are found to have eccentric orbits, and 14 systems are 
exhibiting apsidal motion. 

Out of the crowd emerges yet anot her program to tackle the problem: EBAI (Eclipsing 
Binaries with Artificial Intelligence; iDevinnev et al. 2005 ). This project does not only 
classify the data, it does more: blindingly fast, it determines coarse parameters of eclipsing 
binaries in a large data set. Study is underway for these parameters to be fed to a WD- 
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based solver within PHOEBE. This solver maps the hyperspace around the solution, 
verifying its uniqueness and heuristically determining error estimates. 

Another recent w ork that we wish to draw specific attention to has been done by 
Tam uz et The authors devised a new algorithm called EBAS (Eclipsing Bi- 



nary Automatic Solver), aime d specificall y to large datasets and thus based on the faster, 
yet less accurate EBOP code ( Etzellll98lh . Similarly to the discussed predecessors, EBAS 



also uses the sum of relative radii as a principal parameter. Yet there are two important 
novelties of their approach: instead of inclination the authors introduced the impact pa- 
rameter - the projected distance between the centers of the two stars during the primary 
eclipse, measured in terms of the sum of radii - and they introduced a new "alarm" statis- 
tics, the goal of which is to automatically discriminate best-fit % 2 values from still appar- 
ently acceptable values, but corresponding to distinctively wrong solutions. A follow-up 
application of EBAS on 938 OGLE LMC binaries with B-type main-sequence primary 
stars ( Mazeh et ali\2006(h yielded the distributions of the fractional radii of the two com- 



ponents and their sum, the brightness ratios and the periods of the short-period binaries. 
Intriguingly, they observed that the distribution in logP is flat on the 2-10 days interval 
and that the detected frequency of their target stars is significantly smaller than the 
fre quency deduced by d edicated RV surveys. The details on these findings are also given 
bv lMazeh efall (|2006bh . 



Our attempt to preserve paper readability, and struggling against page limits at the 
same time, regrettably prohibits us to summarize all the work done so far. That is 
why we wish to at least acknowledge other important developments of this field - and 
to apologize for any unintentional omissions in this brief review. Reader interested in 
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4. Traditional observations are not obsolete 

After so much stress on surveys, missions and sophistication in fully automatic ap- 
proaches it is tempting to conclude that traditional object-by-object observations have 
become obsolete. This is one of most dangerous misconceptions, apparently powered even 
by our own statement in the introduction that most (if not all) of the candidates have 
already been observed a number of times. Although these hot topics are appealing be- 
cause of shear numbers of observed objects, there are several deficiencies in the context 
of eclipsing binaries that we should be aware of: 

• Surveys and missions have a limited life-time that is generally not governed by 
the eclipsing binary harvest. Rather, limitations arise on account of funding, technology 
and reaching primary scientific objectives. A direct consequence is the selection effect in 
observed EBs: only the ones with suitable periods will have been detected. 

• The main driving idea of surveys is to acquire as much data as quickly as possible. 
Due to adopted sky scanning laws, the sky coverage is typically non-uniform and the 
observations are thus clustered in time. Although this might not seem too important for 
close binaries, it is critical in case of well detached binaries where there is practically no 
surface deformation and where eclipses occur only on a narrow phase interval. Having a 
point or two within the eclipse is hardly any different than having no point at all. 

• In order to reach survey completeness in terms of object counts during the mission 
life-time, the number of data points per object is usually poor. This means that the 
phase coverage for eclipsing binaries is often not sufficient for recognition and classifica- 
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Figure 8. Phased light curve of ip Cen. Data points from WIRE (black dots) may b e compared to 
those from SMEI (gray phis symbols) and from Hipparcos (grey circles). Taken from Bruntt et al. 
(|2006T ). 

tion purposes, because of the strong sensitivity of period detection algorithms to phase 
completeness. 

• Unprecedented in numbers does not mean unprecedented in accuracy. At least so far, 
survey data has been more challenging to reduce and analyse than a typical dedicated 
observation be cause of the significa nt data scatter. However, WIRE (the Wide-field Infra- 
Red Explorer. Irlacking et al. 19991 ). despite its failure to achieve primary scientific goals, 
had already given us a snapshot of the milli-magnitude photometric precision of the 
future (c.f., Figure [H]). 

• Most importantly: surveys usually lack data diversity. In order to get absolute tem- 
peratures of both stars and interstellar extinction, multi-passband photometry is needed; 
to get reliable estimates of absolute sizes of an eclipsing binary system, radial velocities 
are needed. To break inter-parameter correlations and solution degeneracy, as many di- 
verse data-sets as possible are needed: astrometry and parallaxes, photometry, polarime- 
try, spectroscopy — the more the better. Different physical and geometrical parameters, 
and their inter-dependencies, are revealed by different types of data-sets. 

If we take all of the above into account, we may only conclude that follow-up observations 
are still badly needed. 



5. Conclusions and prospects for the future 

Overwhelming data quantities are upon us and changing traditional ways of modeling 
and analysis of eclipsing binaries is thus inevitable. There are many fine studies that 
bring us closer to this goal. One of the greatest properties of astronomy, when compared 
to other sciences, is a strong sense of collaboration, and absence of blind competition, 
between astronomers; our questions, therefore, on how to facilitate and how to propagate 
the idea of joint development of these new approaches, and how to handle huge data-sets 
that are pouring in, are aimed at every single individual interested in contributing its 
own piece to this fascinating puzzle. These are indeed scientifically challenging times and 
it would be too bad if we missed them. 
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